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(54) System, device, and method for non-disruptive shutdown in a communications network 



(57) When a network node decides to stop routing 
protocol messages, the network node signals that it is 
not operational, but continues routing protocol messag- 



es for a predetermined timeout period. The network 
node may stop routing protocol messages after the pre- 
determined timeout period. 
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Description 

FIELD OF THE INVENTION 

[0001] The present invention relates generally to 
communication systems, and more particularly to non- 
disruptive shutdown in a communication network. 

BACKGROUND OF THE INVENTION 

[0002] In today's information age, computers and 
computer peripherals are often internetworked over a 
communication network. The communication network 
includes a number of network nodes that interoperate 
to route protocol messages within the communication 
network. These network nodes typically run various 
routing protocols in order to determine forwarding paths 
for routing protocol messages within the communication 
network. 

[0003] When a network node fails or otherwise be- 
comes non-operational, the other network nodes need 
to route the protocol messages around the failed net- 
work node. The network nodes typically rely on "keep- 
alive" messages to determine whether a particular net- 
work node is operational. A network node may be con- 
sidered to be operational as long as it is sending "keep- 
alive" messages to its neighboring network nodes or is 
responding to "keep-alive" messages sent by its neigh- 
boring network nodes, as determined by the particular 
routing protocol employed by the various network 
nodes. 

[0004] When a network node stops sending or re- 
sponding to "keep-aiive" messages (which is referred to 
hereinafter as "shutdown"), it takes some amount of 
time before the other network nodes detect that the net- 
work node has stopped sending or responding to "keep- 
alive" messages and begin routing protocol messages 
around the network node (referred to as "reconver- 
gence"). Therefore, the other network nodes continue 
routing protocol messages to the network node for some 
amount of time after the network node stops sending or 
responding to "keep-alive" messages. Because the net- 
work node is not operational, these protocol messages 
are simply dropped. 

SUMMARY OF THE INVENTION 

[0005] In accordance with one aspect of the invention, 
a network node signals that it is not operational and con- 
tinues routing protocol messages for a predetermined 
timeout period upon deciding to stop routing protocol 
messages. This allows the other network nodes to de- 
tect that the network node is not operational and begin 
routing protocol messages around the network node be- 
fore the network node stops routing protocol messages. 
In this way, the network node is able to shut down with- 
out disrupting protocol message routing within the com- 
munication network. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0006] The foregoing and other objects and advantag- 
es of the invention will be appreciated more fully from 
5 the following further description thereof with reference 
to the accompanying drawings wherein: 

FIG. 1 is a logic flow diagram showing exemplary 
logic to stop routing protocol messages in accord- 
ance with an embodiment of the present invention; 
FIG. 2 is a block diagram showing an exemplary 
network node in accordance with an embodiment 
of the present invention; and 
FIG, 3 is a system diagram showing an exemplary 
communication system in accordance with an em- 
bodiment of the present invention. 

DETAILED DESCRIPTION OF A PREFERRED 
EMBODIMENT 

[0007] In an embodiment of the invention, a particular 
network node (referred to hereinafter as the "shutdown" 
network node in order to differentiate it from the other 
network nodes) stops sending or responding to "keep- 
alive" messages for a predetermined timeout period be- 
fore the shutdown network node stops routing protocol 
messages. The predetermined timeout period is prefer- 
ably selected so that the other network nodes are able 
to detect that the network node has stopped sending or 
responding to "keep-alive" messages and begin routing 
protocol messages around the shutdown network node 
(i.e., reconverge) before the shutdown network node 
stops routing protocol messages, and therefore the pre- 
determined timeout period is typically greater than or 
equal to the timeout period defined by the particular rout- 
ing protocol used by the network nodes. Because the 
other network nodes have already begun routing proto- 
col messages around the shutdown network node by the 
time the shutdown network node has stopped sending 
or responding to "keep-alive" messages, no protocol 
messages are mis-routed to the shutdown network 
node, and therefore no protocol messages are dropped 
due to the shutdown network node. 
[0008] More particularly, when the shutdown network 
node decides to stop routing protocol messages, for ex- 
ample, upon being instructed to do so through a network 
management interface or upon detecting an internal fail- 
ure (for example, as described in the related US patent 
application entitled SYSTEM, DEVICE, AND METHOD 
FOR DETERMINING THE OPERATIONAL STATUS OF 
A NETWORK DEVICE, filed in the name of Bradley Cain 
on even date, the shutdown network node stops sending 
or responding to "keep-alive" messages, and starts a 
timer. The timer runs for the predetermined timeout pe- 
riod, which is typically set according to the timeout pe- 
riod for the particular routing protocol being used by the 
network nodes (for example, 30 seconds for OSPF). 
Once the timer expires, the shutdown network node can 
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safely stop routing protocol messages. 
[0009] After the shutdown network node stops send- 
ing or responding to "keep-alive" messages but before 
the other network nodes detect that the shutdown net- 
work node is no longer sending or responding to "keep- 5 
alive" messages, the other network nodes may route 
protocol messages to the shutdown network node, and 
the shutdown network node in turn routes the protocol 
messages as usual. 

[001 0] When the other network nodes detect that the 
shutdown network node is no longer sending or re- 
sponding to "keep-alive" messages, the other network 
nodes use the routing protocol to determine new routes 
that bypass the shutdown network node. In accordance 
with the present invention, the other network nodes use 
an existing routing protocol mechanism to detect that 
the shutdown network node is no longer sending or re- 
sponding to "keep-alive" messages. Therefore, the oth- 
er network nodes operate as usual, and require no logic 
changes in order to detect and bypass the shutdown 
network node. 

[0011] FIG. 1 is a logic flow diagram showing exem- 
plary logic 100 to stop routing protocol messages in ac- 
cordance with an embodiment of the present invention. 
Beginning at step 1 02, and upon deciding to stop routing 
protocol messages, in step 1 04, the logic stops sending 
or responding to "keep-alive" messages, in step 106, 
and starts a timer for a predetermined timeout period, 
in step 108. The logic continues routing protocol mes- 
sages during the timeout period, in step 110. Upon de- 
termining that the timer expired, in step 112, the logic 
stops routing protocol messages, in step 114. The logic 
100 terminates in step 199. 

[0012] FIG. 2 is a block diagram showing an exem- 
plary network node 200 in accordance with an embodi- 
ment of the present invention. The exemplary network 
node 200 includes, among other things, shutdown logic 
202, keep-alive logic 204, routing logic 206, and timer 
logic 208. The shutdown logic 202 is responsible for 
controlling the shutdown process for the network node 
200. The keep-alive logic 204 is responsible for sending 
or responding to"keep-alive" messages. The routing 
logic 206 is responsible for routing protocol messages. 
The timer logic 208 is responsible for timer functions 
within the network node 200. 

[0013] When the shutdown logic 202 decides to stop 
the routing logic 206 from routing protocol messages, 
for example, upon receiving an instruction through a net- 
work management interface (not shown) or upon detect- 
ing a failure, the shutdown logic 202 first signals the 
keep-alive logic 204 to stop sending or responding to 
"keep-alive" messages. The shutdown logic 202 then 
starts a timer for a predetermined timeout period using 
the timer logic 208. During the timeout period, the shut- 
down logic 202 signals or otherwise enables the routing 
logic 206 to continue routing protocol messages. When 
the shutdown logic 202 determines that the timer has 
expired, the shutdown logic 202 signals the routing logic 
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206 to stop routing protocol messages. 
[001 4] FIG. 3 is a system diagram showing an exem- 
plary communication system 300 in accordance with an 
embodiment of the present invention. The exemplary 
communication system 300 includes a plurality of inter- 
connected network nodes (302, 304, 306, 308, 310). 
The network node 304 is the shutdown network node. 
The network node 302 is coupled to the shutdown net- 
work node 304 over the interlace 303 and to the network 
node 31 0 over the interface 31 1 . The shutdown network 
node 304 is coupled to the network node 302 over the 
interface 303 and to the network node 306 over the in- 
terface 305. The network node 305 is coupled to the 
shutdown network node 304 over the interface 305 and 
to the network node 308 over the interface 307. The net- 
work node 308 is coupled to the network node 306 over 
the interface 307 and to the network node 31 0 over the 
interface 309. The network node 310 is coupled to the 
network node 308 over the interface 309 and to the net- 
work node 302 over the interface 311 . 
[001 5] During normal operation of the network, when 
the shutdown network node 304 is still routing protocol 
messages and the other network nodes (302, 306, 308, 
31 0) have not begun routing protocol messages around 
the shutdown network node 304, communications be- 
tween the network node 302 and the network node 306 
may be routed through the shutdown network node 304, 
since the path through the shutdown network node 304 
is the shortest path (at least by hop count). 
[0016] When the shutdown network node 304, and 
particularly the shutdown logic 202 in the shutdown net- 
work node 304, decides to stop routing protocol mes- 
sages, the shutdown network node 304 stops sending 
or responding to "keep-alive" messages, but continues 
routing protocol messages. As part of the normal routing 
protocol operation, the other network nodes (302, 306, 
308, 310) detect that the shutdown network node 304 
has stopped sending or responding to "keep-alive" mes- 
sages, and use the routing protocol to determine new 
routes that bypass the shutdown network node 304. In 
particular, communications between the network node 
302 and the network node 306 may be routed through 
the network node 308 and the network node 310 rather 
than through the shutdown network node 304. Once the 
other network nodes (302, 306, 308, 310) have begun 
routing protocol messages around the shutdown net- 
work node 304 (i.e., after a predetermined timeout pe- 
riod), the shutdown network node 304 stops routing pro- 
tocol messages. 

[0017] It should be noted that the "keep-alive" mes- 
sages referred to herein may be any type of signal that 
is used to indicate whether or not a particular network 
node is operational, either as part of a routing protocol 
or otherwise. When the shutdown network node stops 
sending or responding to "keep-alive" messages, it es- 
sentially signals to the other nodes that it is no longer 
operational, even though the shutdown network node 
continues routing protocol messages during the timeout 
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period. Thus, various embodiments of the present in- 
vention may use practically any mechanism for signal- 
ing that the shutdown network node is not operational. 
The present invention is in no way limited to the type of 
"keep-alive" message or the type of mechanism that is 
used by the other network nodes to detect that the shut- 
down network node has stopped sending or responding 
to "keep-alive" messages and begin routing protocol 
messages around the shutdown network node. 
[0018] In a preferred embodiment of the present in- 
vention, predominantly all of the shutdown logic and re- 
lated logic is implemented as a set of computer program 
instructions that are stored in a computer readable me- 
dium and executed by an embedded microprocessor 
system within the shutdown network node. Preferred 
embodiments of the invention may be implemented in 
any conventional computer programming language. For 
example, preferred embodiments may be implemented 
in a procedural programming language (e.g., tt C n ) or an 
object oriented programming language (e.g., "C++"). Al- 
ternative embodiments of the invention may be imple- 
mented using discrete components, integrated circuitry, 
programmable logic used in conjunction with a program- 
mable logic device such as a Field Programmable Gate 
Array (FPGA) or microprocessor, or any other means 
including any combination thereof. 
[0019] Alternative embodiments of the invention may 
be implemented as a computer program product for use 
with a computer system. Such implementation may in- 
clude a series of computer instructions fixed either on a 
tangible medium, such as a computer readable media 
(e.g., a diskette, CD-ROM, ROM, or fixed disk), or fixed 
in a computer data signal embodied in a carrier wave 
that is transmittable to a computer system via a modem 
or other interface device, such as a communications 
adapter connected to a network over a medium. The 
medium may be either a tangible medium (e.g., optical 
or analog communications lines) or a medium imple- 
mented with wireless techniques (e.g., microwave, in- 
frared or other transmission techniques). The series of 
computer instructions embodies all or part of the func- 
tionality previously described herein with respect to the 
system. Those skilled in the art should appreciate that 
such computer instructions can be written in a number 
of programming languages for use with many computer 
architectures or operating systems. Furthermore, such 
instructions may be stored in any memory device, such 
as semiconductor, magnetic, optical or other memory 
devices, and may be transmitted using any communica- 
tions technology, such as optical, infrared, microwave, 
or other transmission technologies. It is expected that 
such a computer program product may be distributed as 
a removable medium with accompanying printed or 
electronic documentation (e.g., shrink wrapped soft- 
ware), preloaded with a computer system (e.g., on sys- 
tem ROM or fixed disk), or distributed from a server or 
electronic bulletin board over the network (e.g., the In- 
ternet or World Wide Web). 



[0020] The present invention may be embodied in oth- 
er specific forms without departing from the essence or 
essential characteristics. The described embodiments 
are to be considered in all respects only as illustrative 
5 and not restrictive. 



Claims 

w 1. A shutdown method for a network node in a com- 
munication network, the method comprising: 

deciding to stop routing protocol messages; 
signaling that the network node is not opera- 
's tional upon deciding to stop routing protocol 
messages; and 

continuing to route protocol messages for a 
predetermined timeout period after signaling 
that the network node is not operational. 

20 

2. The shutdown method of claim 1 , wherein deciding 
to stop routing protocol messages comprises re- 
ceiving an instruction through a network manage- 
ment interface. 

25 

3. The shutdown method of claim 1 , wherein deciding 
to stop routing protocol messages comprises de- 
tecting an internal failure within the device. 

30 4. The shutdown method of claim 1 , wherein signaling 
that the network node is not operational comprises 
forcing the device to stop sending keep-alive mes- 
sages. 

35 5. The shutdown method of claim 1 , wherein signaling 
that the network node is not operational comprises 
forcing the device to stop responding to keep-alive 
messages. 

40 6. The shutdown method of claim 1 , wherein continu- 
ing to route protocol messages for the predeter- 
mined timeout period comprises: 

starting a timer for the predetermined timeout 
45 period; and 

continuing to route protocol messages while 
the timer is running. 

7. A device comprising: 

50 

signaling logic operably coupled to send sig- 
nals to indicate that the device is operational; 
routing logic operably coupled to route protocol 
messages; and 
55 shutdown logic operably coupled to decide to 

stop routing protocol messages and to prevent 
the signaling logic from sending the signals for 
a predetermined timeout period while enabling 
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the routing logic to continue routing protocol 
messages for the predetermined timeout peri- 
od upon deciding to stop routing protocol mes- 
sages. 

5 

8. A program product comprising a computer readable 
medium having embodied therein a computer pro- 
gram for performing a network anode shutdown, the 
computer program comprising: 

10 

signaling logic programmed to send signals to 
indicate that the device is operational; 
routing logic programmed to route protocol 
messages; and 

shutdown logic programmed to decide to stop *5 
routing protocol messages and to prevent the 
signaling logic from sending the signals for a 
predetermined timeout period while enabling 
the routing logic to continue routing protocol 
messages for the predetermined timeout peri- 20 
od upon deciding to stop routing protocol mes- 
sages. 

9. A communication system comprising a plurality of 
interconnected network nodes, wherein a first net- 25 
work node is operably coupled to signal that the first 
network node is not operational and to continue 
routing protocol messages for a predetermined 
timeout period after signaling that the first network 
node is not operational, and wherein a second net- 30 
work node is operably coupled to continue routing 
protocol messages to the first network node during 

at least a portion of the predetermined timeout pe- 
riod until the second network node determines that 
the first network node is not operational. 35 

10. A communication system comprising a plurality of 
interconnected nodes, wherein each of the nodes 
comprises: 

40 

means for determining when the node is not op- 
erational; and 

means for forwarding protocol messages for a 
predetermined time period after determining 
that the node is not operational. 45 
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